pBMesterasel 



pBluescript II KS+ 



.BamHI 

| f stl .Hindlil ,BamHI 



EcoR I.Pstl 
EcoRVlBamHI 




Conversion of BamHI to Smal 



Smal 



pBMesterasel b 
FIG. 2 



pBMesteraselb 

EcoRI. 
EcoRVjPstl 




pBMesterase3 
FIG. 3 



Smal/ EcoRV esterase fragment 
from pBMesterase3 
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• 3.8 kb EcoRV/Smal fragment 
inserted into Smai site of pSJC62. 
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N -TERMINAL AMINO ACID SEQUENCE 



AMINO ACID SEQ. 



T N P N E P 



REV. TRANSLATION 



INVERSE 



ACX AAPy CCX AAPy GAPu CC 
GGPy TCPu TTX GGPu TTX GT 



PROBE 1 
2 
3 
4 



GGPy TCPu TTG GGPu TTX GT 
A 
T 
C 



Four 17-mer oligonucleotide probes each with a 32-fold degeneracy were 
synthesized from the N-terminal amino acid sequence and used to probe a 
Southern blot of R.toruloides DNA. 
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RHODOSPORIDIUM ESTERASE cDNA 



ATGCTCCTTAACCTCTTCACCCTCGCCTCCCTCGCTGCGACGCTCCAGCTCGCCTTTGCC 70 
MLLNLFTLASLAATLQLAFA 

TCTCCGACCTCCCTCGTCCGCCGCACGAACCCAAACGAGCCCCCTCCCGTCGTCGACCTC 130 
S PTSLVRRTN PNEPPPVVDL 

GGCTACGCCCGCTACCAAGGCTACTTGAACGAGACCGCCGGACTCTACTGGTGGCGCGGA 190 
GYARYQGYLNETAGLYWWRG 

ATCCGCTACGCCTCGGCTCAGCGCTTCCAGGCTCCTCAGACGCCCGCGACGCACAAGGCC 250 
IRYASAQRFQAPQTPATHKA 

GTCCGCAACGCGACTGAGTATGGACCGATCTGTTGGCCGGCTAGCGAGGGAACCAACACG 310 
VRNATEYGPICWPASEGTNT 

ACCAAGGGCTTGCCGCCGCCTAGCAACAGCTCGAGCAGCGCGCCGCAGAAACAGGCGTCG 370 
TKGLPPPSNSSSSAPQKQAS 

GAGGATTGCCTCTTCCTCAATGTCGTTGCCCCCGCCGGCTCGTGCGAGGGCGACAATCTT 430 
EDCLFLNVVAPAGSCEGDNL 

CCCGTCCTCGTCTACATTCACGGAGGTGGCTACGCCTTCGGCGATGCGAGCACCGGCAGC 490 
PVLVYIHGGGYAFGDASTGS 

GACTTTGCCGCCTTCACCAAGCACACGGGAACCAAGATGGTCGTTGTAAATCTCCAGTAC 550 
D FAAFTKHTGT KMVVVNLQY 

CGTCTCGGCAGCTTTGGTTTCCTCGCTGGCCAAGCCATGAAGGACTACGGTGTAACGAAC 610 
RLGSFGFLAGQAMKDYGVTN 

GCCGGCTTGCTTGACCAGCAATTCGCCCTTCAATGGGTTCAACAGCACGTCTCGAAGTTC 67 0 

AGLLDQQFALQWVQQHVSKF 

GGCGGCAACCCCGATCACGTTACGATTTGGGGCGAGTCTGCAGGCGCAGGGTCCGTTATG 730 
GGNPDHVTIWGESAGAGSVM 

AACCAGATCATTGCGAACGGCGGCAACACCGTCAAGGCTCTCGGTCTCAAGAAGCCCCTC 790 
NQI IANGGNTVKALGLKKPL 

TTCCACGCTGCCATCGGCTCCTCCGTCTTCCTCCCCTACCAAGCCAAGTACAACTCCCCC 850 
FHAAIGSSVFL PYQAKYNS P 

TTCGCCGAGCTGCTCTACTCCCAACTCGTCTCGGCGACAAACTGCACCAAAGCCGCCTCG 910 
FAELLYSQLVSATNCTKAAS 

TCCTTCGCTTGCCTCGAAGCTGTCGACGCTGCGGCGCTCGCTGCGGCGGGCGTGAAGAAC 970 
S FACLEAVDAAALAAAGVKN 

TCGGCGGCGTTCCCGTTCGGGTTTTGGTCGTATGTCCCGGTCGTCGACGGGACCTTCTTG 1030 
SAAFPFGFWSYVPVVDGTFL 
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ACTGAGCGCGCGTCGCTCCTTCTCGCCAAGGGCAAGAAGAACCTCAATGGCAACCTCTTC 1090 
TERASLLLAKGKKNLNGNLF 

ACCGGGATCAACAACCTCGACGAAGGATTCATATTCACTGACGCCACTATTCAGAACGAC 1150 
TGINNLDEGFIFTDATIQND 

ACGATCAGCGACCAGTCGCAGCGCGTCTCCCAGTTCGACCGCCTCCTCGCCGGCCTCTTC 1210 
TIS D Q S QRVSQ FDRLLAGL F 1 

CCCTACATCACCTCGGAGGAGCGCCAGGCCGTCGCGAAGCAGTACCCGATCTCCGACGCG 1270 
PYITSEERQAVAKQYPISDA 

CCGTCAAAGGGCAACACCTTCTCTCGCATCTCGGCCGTCATCGCGGACTCGACCTTCGTC 1330 
PSKGNTFSRISAVIADSTFV 

TGCCCGACCTACTGGACCGCCGAGGCGTTCGGCTCGTCCGCCCACAAGGGCCTCTTCGAC 1390 
CPTYWTAEAFGSSAHKGLFD 

T ACGCGCCGGCTCACCACGCGACCGACAACTCGTACTACATCGGCTCCATCTGGAACGGC 1450 
YAPAHHATDNSYYIGSIWNG 

AAGAAGTCGGTCTCGTCCGTCCAGTCCTTCGACGGCGCGCTCGGCGGCTTCATCGAGACG 1510 
KKSVSSVQSFDGALGGFIET 

TTCAACCCGAACAACAACGCTGCCAACAAGACCATCAACCCTTACTGGCCGACGTTCGAC 1570 
FN PNNNAANKT INPYWPTFD 

TCGGGCAAGCAGCTCCTCTTCAACACGACGACGAGGGACACCCTCTCTCCCGCCGACCCG 1630 
SGKQLLFNTTTRDTLSPADP 

CGCATCGTTGAGACTTCAAGCTTGACCGACTTTGGCACGAGCCAGAAGACCAAGTGCGAC 1 690 
RIVETSSLTDFGTSQKTKCD 

TTCTGGCGTGGGTCAATCTCGGTGAACGCGGGTCTC 1726 
FWRGS I SVNAGL 
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GGATCC ACCCGAACTCTGTCCCGCTTTCTGGCTTTCTTCCTTGCTGTCGCCCCATCGCCT g 0 

J-- Translation Start — > 
TTCCCG ACTCGCCGCCATGCTCCTTAACCTCTTCACCCTCGCCTCCCTCGCTGCG ACGCT 120 
MLLNLFTLASLAATL 

J- Mature peptide -> 
CCAGCTCGCCTTTGCCTCTCCGACCTCCCTCGTCCGCCGCACGAACCCAAACGAGCCCCC 180 
QLAFASPTSLVRRTNPNEPP 

TCCCGTCGTCGACCTCGGCTACGCCCGCTACCAAGGCTACTTGAACGAG ACCGCCGGACT 240 
PVVDLGYARYQGYLNETAGL 

CTACTGGTGGCGCGGAATCCGCTACGCCTCGGCTCAGCGCTTCCAGGCTCCTCAGACGCC 300 
YWWRGIRYASAQRFQAPQTP 

CGCGACGCACAAGGCCGTCCGCAACGCGACTGAGTATGGACCGATCTGTTGGCCGGCTAG 3 60 
ATHKAVRNATEYGP I CWPAS 

CGAGGGAACCAACACGACCAAGGGCTTGCCGCCGCCTAGCAACAGCTCGAGCAGCGCGCC 420 
EGTNTTKGLPPPSNS SSSAP 

GCAGAAACAGGCGTCGGAGGATTGCCTCTTCCTCAATGTCGTTGCCCCCGCCGGCTCGTG 480 
QKQASEDCLFLNVVA PAGSC 

CG AGGGCGACAATCTTCCCGTCCTCGTCTACATTCACGGAGGTGGCTACGCCTTCGGCGA 540 
EGDNLPVL.VYIHGGGYAFGD 

TGCGAGCACCGGCAGCGACTTTGCCGCCTTCACCAAGCACACGGGAACCAAGATGGTCGT 600 
ASTGSDFAAFTKHTGTKMVV 

TGTAAATCTCCAGTACCGTCTCGGCAGCTTTGGTTTCCTCGCTGGCCAAGCCATGAAGGA 660 
VNLQYRLGSFGFLAGQAMKD 

I Intron il 

CTACGGTGTAACGAACGCCGGCTTGCTTGACCAGGTGAGTTTCCCGCATGATACCCGCCC 720 
YGVTNAGLLDQ 

" J 

ACCTTTCGACTCATGCTG ACGCCTCTCCCGCTCGCAGCAATTCGCCCTTCAATGGGTTCA 780 

QFALQWVQ 

ACAGCACGTCTCG AAGTTCGGCGGCAACCCCGATCACGTTACG ATTTGGGGCGAGTCTGC 840 
QHVSKFGGNPDHVTIWGESA 

[ Intron 12 

AGGCGCAGGGTCCGTTATGAACCAGATCATTGCGAACGTGAGCCACCCGAACCGATCTCC 900 
GAGSVMNQIIAN 




960 
1020 



VKALGLKKPLFHAA IGSSVF 



TCCTCCCCTACCAAGCCAAGTACAACTCCCCCTTCGCCGAGCTGCTCTACTCCCAACTCG 1080 
LPYQAKYNSPFAELLYSQLV 

FIG. 9A 



CTGCGGCGCTCGCTGCGGCGGGCGTGAAGAACTCGGCGGCGTTCCCGTTCGGGTTTTGGT 1200 
AALAAAGVKNSAAFPFGFWS 

CGTATGTCCCGGTCGTCGACGGGACCTTCTTGACTGAGCGCGCGTCGCTCCTTCTCGCCA 1260 
YVPVVDGTFLTERASLLLAK 

t Intron #3 

AGGGCAAGAAGAACCTCAATGGCGTGCGTGGCGAGCTTTCGAGTGCTTCAGGATCTCGCT 1320 
G K K N L N G 

1 r — 

GACACTGTCGACCGGCTCGCAGAACCTCTTCACCGGGATCAACAACCTCGACGAAGATGA 1380 
NLFTGINNLDEG 



1440 



ATTCATATTCACTGACGCCACTATTCAGAACGACACGATCAGCGACCAGTCGCAGCGCGT 1500 
FIFTDATIQNDTISDQSQRV 

CTCCCAGTTCGACCGCCTCCTCGCCGGCCTCTTCCCCTACATCACCTCGG AGGAGCGCCA 1560 
SQFDRLLAGLFPYITSEERQ 

GGCCGTCGCGAAGCAGTACCCGATCTCCGACGCGCCGTCAAAGGGCAACACCTTCTCTCG 1620 
AVAKQYPI SDAPSKGNTFS R 

[ Intron #5 

CATCTCGGCCGTCATCGCGGACTCGACCTTCGTGTGCGTTCCCCGTCGTCTTCTCCGAGT 1680 
ISAVIADSTFV 

ATTCCGCTGACTTCCCGCTTGCCCGCAGCTGCCCGACCTACTGGACCGCCGAGGCGTTCG 1740 
CPTYWTAEAFG 

GCTCGTCCGCCCACAAGGGCCTCTTCGACTACGCGCCGGCTCACCACGCGACCGACAACT 1800 
SSAHKGLFDYAPAHHATDNS 

CGTACTACATCGGCTCCATCTGGAACGGCAAGAAGTCGGTCTCGTCCGTCCAGTCCTTCG 1860 
YYIGSIWNGKKSVSSVQSFD 

ACGGCGCGCTCGGCGGCTTCATCGAGACGTTCAACCCGAACAACAACGCTGCCAACAAGA 1920 
GALGGFIETFNPNNNAANKT 

CCATCAACCCTTACTGGCCGACGTTCX3ACTCGGGCAAGCAGCTCCTCTTCAACACGACGA 1980 
INPYWPTFDSGKQLLFNTTT 

CGAGGGACACCCTCTCTCCCGCCGACCCGCGCATCGTTGAGACTTCAAGCTTGACCGACT 2040 
RDTLSPADPRIVETSSLTDF 

TTGGCACGAGCCAGAAGACCAAGTGCGACTTCTGGCGTGGGTCAATCTCGGTGAACGCGG 2100 
GTSQKTKCDFWRGSISVNAG 

GTCTCTAGGCGTCTTTCCTTCCGACTTCCTTCGTTCTTTCGTTGTTTATTC 2160 
L * 

CGTTGTATCGGCCATTCGTGCGTGTAGCTCACTCGAGTATAGACGTTGGCAAGTGCGAAA 2220 
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TRN 2-1738 RHODOSPORIDIUM ESTERASE cDNA$ 
TTranslation Start TMature Peptide 

MLLNLFTLASLAATLQLAFAS PT S LVRRTN PNEP P P WDLG YARYQGYLNETAGLYWWRG 
IRYASAQRFQAPQTPATHKAVRNATEYGPICWPASEGTNTTKGLPPPSNSSSSAPQKQAS 
EDCLFLNWAPAGSCEGDNLPVLVYIHGGGYAFGDASTGSDFAAFTKHTGTKMVWNLQY 
RLGSFGFLAGQAMKDYGVTNAGLLDQQFALQWVQQHVSKFGGNPDHVTIWGESAGAGSVM 
NQ I 1 ANGGNT VKALGLKKPL FHAAI GS S VFL P YQAKYN S P FAELL Y SQLVS ATNCTKAAS 
SFACLEAVDAAALAAAGVKNSAAFPFGFWSYVPWDGTFLTERASLLLAKGKKNLNGNLF 
TGINNLDEGFIFTDATIQNDTISDQSQRVSQFDRLLAGLFPYITSEERQAVAKQYPISDA 
PSKGNTFSRISAVIADSTFVCPTYWTAEAFGSSAHKGLFDYAPAHHATDNSYYIGSIWNG 
KKSVSSVQSFDGALGGFIETFNPNNNAANKTINPYWPTFDSGKQLLFNTTTRDTLSPADP 
RIVETSSLTDFGTSQKTKCDFWRGSISVNAGL* 
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Amino acid composition from 1 to 572 
TKN 2-1738 RHODOSPORIDIUM ESTERASE cDNA 



Total 


Percent 


A 


67 


11.7 


C 


7 


1.2 


D 


25 


4.4 


E 


16 


2.8 


F 


35 


6.1 


G 


49 


8.6 


H 


9 


1.6 


I 


21 


3.7 


K 


25 


4.4 


L 


48 


8.4 


M 


4 


0.7 


N 


35 


6.1 


P 


31 


5.4 


Q 


26 


4.5 


R 


16 


2.8 


S 


52 


9.1 


T 


43 


7.5 


V 


32 


5.6 


W 


10 


1.7 


Y 


21 


3.7 


Acidic 


41 


7.2 


Basic 


41 


7.2 


Charged 


82 


14.3 


Net charge 


0 


0.0 


Hydrophobic 


136 


23.8 



Residues 572 
MW 61334 
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